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Abstract 

One of the most important challenges in the integration of renewable energy sources into 
the power grid lies in their 'intermittent' nature. The power output of sources like wind and 
solar varies with time and location due to factors that cannot be controlled by the provider. 
Two strategies have been proposed to hedge against this variability: 1) use energy storage 
systems to effectively average the produced power over time; 2) exploit distributed generation 
to effectively average production over location. We introduce a network model to study the 
optimal use of storage and transmission resources in the presence of random energy sources. 
We propose a Linear-Quadratic based methodology to design control strategies, and we show 
that these strategies are asymptotically optimal for some simple network topologies. For these 
topologies, the dependence of optimal performance on storage and transmission capacity is 
explicitly quantified. 

1 Introduction 

It is widely advocated that future power grids should facilitate the integration of a significant 
amount of renewable energy sources. Prominent examples of renewable sources are wind and solar. 
These differ substantially from traditional sources in terms of two important qualitative features: 
Intrinsically distributed. The power generated by these sources is typically proportional to the 
surface occupied by the corresponding generators. For instance, the solar power reaching ground 
is of the order of 2 kWh per day per square meter. The wind power at ground level is of the order 
of 0.1 kWh per day per square meter [I]. These constraints on renewable power generation have 
important engineering implications. If a significant part of energy generation is to be covered by 
renewables, generation is argued to be distributed over large geographical areas. 
Intermittent. The output of renewable sources varies with time and locations because of exoge- 
nous factors. For instance, in the case of wind and solar energy, the power output is ultimately 
determined by meteorological conditions. One can roughly distinguish two sources of variability: 
predictable variability, e.g. related to the day-night cycle, or to seasonal differences; unpredictable 
variability, which is most conveniently modeled as a random process. 

Several ideas have been put forth to meet the challenges posed by intermittent production. The 
first one is to leverage geographically distributed production. The output of distinct generators is 
likely to be independent or weakly dependent over large distances and therefore the total production 



of a large number of well separated generators should stay approximately constant, by a law-of- 
large-number effect. This averaging effect should be enhanced by the integration of different types 
of generators. 

The second approach is to use energy storage to take advantage of over-production at favorable 
times, and cope with shortages at unfavorable times. Finally, a third idea is 'demand response', 
which aims at scheduling in optimal ways some time-insensitive energy demands. In several cases, 
this can be abstracted as some special form of energy storage (for instance, when energy is demanded 
for interior heating, deferring a demand is equivalent to exploiting the energy stored as hot air inside 
the building). 

These approaches hedge against the energy source variability by averaging over location, or by 
averaging over time. Each of them requires specific infrastructures: a power grid with sufficient 
transmission capacity in the first case, and sufficient energy storage infrastructure in the second 
one. Further, these two directions are in fact intimately related. With current technologies, it is 
unlikely that centralized energy storage can provide effective time averaging of -say- wind power 
production, in a renewables-dominated scenario. In a more realistic scheme, storage is distributed 
at the consumer level, for instance leveraging electric car batteries (a scenario known as vehicle- 
to-grid or V2G). Distributed storage implies, in turn, substantial changes of the demand on the 
transmission system. 

The use of storage devices to average out intermittent renewables production is well established. 
A substantial research effort has been devoted to its design, analysis and optimization (see, for 
instance, [2l[3llll[5l[6l[7]). In this line of work, a large renewable power generator is typically 
coupled with a storage system in order to average out its power production. Proper sizing, response 
time, and efficiency of the storage system are the key concerns. 

If, however, we assume that storage will be mainly distributed, the key design questions change. 
It is easy to understand that both storage and transmission capacity will have a significant effect 
on the ability of the network to average out the energy source variability. For example, shortfalls 
at a node can be compensated by either withdrawals from local storage or extracting power from 
the rest of the network, or a combination of both. The main goal of this paper is to understand 
the optimal way of utilizing simultaneously these two resources and to quantify the impact of these 
two resources on performance. Our contributions are: 

• a simple model capturing key features of the problem; 

• a Linear-Quadratic(LQ) based methodology for the systematic design of control strategies; 

• a proof of optimality of the LQ control strategies in simple network topologies such as the 
1-D and 2-D grids and in certain asymptotic regimes. 

• a quantification of how the performance depends on key parameters such as storage and 
transmission capacities. 

The reader interested in getting an overview of the conclusions without the technical details can 
read Sections [2] and |4] only. Some details are omitted and deferred to the journal version of this 
paper. 
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2 Model and Problem Formulation 



The power grid is modeled as a weighted graph G with vertices (buses or nodes) V, edges (hnes) 
E. Time is slotted and will be indexed by t G {0, 1,2, ... }. In slot t, at each node i £ V a 
quantity of energy -Ep,j(t) is generated from a renewable source, and a demand of a quantity Ed^iit) 
is received. For our purposes, these quantities only enter the analysis through the net generation 
Zi{t) = Ep^i{t) — E^^i{t). Let Z{t) be the vector of Zj(t)'s. We will assume that {Z{t)} is a 
stationary process. 

In order to average the variability in the energy supply, the system makes use of storage and 
transmission. Storage is fully distributed: each node i € V has a device that can store energy, with 
capacity Si. We assume that stored energy can be fully recovered when needed (i.e., no losses). At 
each time slot t, one can transfer an amount of energy 3^j(t) to storage at node i. If we denote by 
Bi{t) the amount of stored energy at node i just before the beginning of time slot t, then: 

BiiO) = 0, B^{t + 1) = B,it) + yi{t) (1) 

where 3^i(t) is chosen under the constraint that Bi{t + 1) G [0, Si]. 

We will also assume the availability at each node of a fast generation source (such as a spin- 
ning reserve or backup generator) which allows covering up of shortfalls. Let Wi(t) be the energy 
obtained from such a source at node i at time slot t. We will use the convention that Wi(t) 
negative means that energy is consumed from the fast generation source, and positive means en- 
ergy is dumped. The cost of using fast generation energy sources is reflected in the steady-state 
performance measure: 

°° ' ' i&V 

The net amount of energy injection at node i at time slot t is: 

Zi{t)-yi{t)-Wi{t). 

These injections have to be distributed across the transmission network, and the ability of the 
network to distribute the injections and hence to average the random energy sources over space 
is limited by the transmission capacity of the network. To understand this constraint, we need to 
relate the injections to the power flows on the transmission lines. To this end, we adopt a 'DC 
power flow' approximation model [8]. 

Each edge in the network corresponds to a transmission line which is purely inductive, i.e. 
with susceptance —jb^, where € M+. Hence, the network is lossless. Node i £ y is at voltage 
Vi{t), with all the voltages assumed to have the same magnitude, taken to be 1 (by an appropriate 
choice of units). Let Vi{t) = e-''^'^*^ denote the (complex) voltage at node i in time slot t. If 
Ii,k{i) = —jbik{Vi{t) — Vfc(t)) is the electric current from i to k, the corresponding power flow is 
then j;,fc(t) = Re[V,{t)Ii^k{t)*] = Re[j6,fe(l - e^(Mt)-Mt)))] = b,ksm{Ut) - M^)), where Re[-] 
denotes the real part of a complex number. 

The DC flow approximation replaces sin(</)j(t) — (j)k{t)) by 0i(t) — </'fc(t) in the above expression. 
This is usually a good approximation since the phase angles at neighboring nodes are typically 
maintained close to each other to ensure that the generators at the two ends remain in step. This 

^Despite the name, 'DC flow' is an approximation to the AC flow 
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leads to the following relation between angles and power flow J-i^kit) = biki4'iit) — 4>k{t))- In matrix 
notation, we have 

^(t) = V(^(t), (3) 

where J-{t) is the vector of all power flows, (p{t) = ((/>i(t), . . . and V is a \E\ x \V\ matrix. 

Ve,j = 6e if e = (z, k) for some k, V e,i = —he if e = {k, i) for some /c, and Ve,j = otherwise. 
Energy conservation at node i also yields 

k 

where b = diag(6e) is an \E\ x \E\ diagonal matrix. Expressing J-{t) in terms of (j){t), we get 

Z{t)-y{t)-W{t) = -A^{t), (4) 

where A = — V"^b~^V is a \V\ x \V\ symmetric matrix where Aj^^ = — z)g_E if z = A:, 
^i,fe = bik if {i, k) £ E and otherwise. In graph theory, A is called the graph Laplacian matrix. 
In power engineering, it is simply the imaginary part of the bus admittance matrix of the network. 
Note that if 6e ^ for all edges e, then —A is positive semidefinite. If the network is connected 
(which we assume throughout), it has only one eigenvector with eigenvalue 0, namely the vector 
(pv = 1 everywhere (hereafter we will denote this as the vector 1). This fits the physical fact that 
if all phases are rotated by the same amount, the powers in the network are not changed. 

With an abuse of notation, we denote by A^-^ the matrix such that A^^l = —M 1, and A^^ is 
equal to the inverse of A on the subspace orthogonal to 1. Here M > is arbitrary, and all of our 
results are independent of this choice (conceptually, one can think of M as very large). Explicitly, 
let A = — Va^V"^ be the eigenvalue decomposition of A, where a is a diagonal matrix with non- 
negative entries. Define to be the diagonal matrix with = M if an = and aj- = a^-^ 
otherwise. Then A"^ = -V{a'^f\''^. 

Since the total power injection in the network adds up to zero (which must be true by energy 
conservation), we can invert (jl]) and obtain 

cP{t) = -A-Hz{t)-y{t)-w{t)). (5) 

Plugging this into ([3]), we have 

J^{t) = - VA-i {Z{t) - y{t) - W{t)) . (6) 

There is a capacity limit Ce on the power fiow along each edge e; this capacity limit depends 
on the voltage magnitudes and the maximum allowable phase differences between adjacent nodes, 
as well as possible thermal line limits. We will measure violations of this limit by defining 

^ hm -LY^E{{Teit) - Ce)+ + {-Ce - Te{t))+} . (7) 

' ' eG-E 

We are now ready to state the design problem: 

For the dynamic system defined by equations and design a control strategy which, given 
the past and present random renewable supplies and the storage states, 

{{Z{t),B{t)y, {Z{t-l),B{t-l)),...,} 

choose the vector of energies y{t) to put in storage and the vector of fast generations W(t) such 
that the sum etot = ej" + cf. Eq. (0) and is minimized. 
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3 Linear- Quadratic Design 

In this section, we propose a design methodology that is based on Linear-Quadratic (LQ) control 
theory. 

3.1 The Surrogate LQ Problem 

The difficulty of the control problem defined above stems from both the nonlinearity of the dynamics 
due to the hard storage limits and the piecewise linearity of the cost functions giving rise to the 
performance parameters. Instead of attacking the problem directly, we consider a surrogate LQ 
problem where the hard storage limits are removed and the cost functions are quadratic: 



Bi{0) = ^, B,{t + l) = Bi{t) + Yi{t), (8) 

F{t) = -VA-\Z{t) - Y{t) -W{t)) , (9) 

with performance parameters: 

^surrogate ^ lin,E{(M^,(t))'}, iGV, (10) 

^surrogate ^ E{(Fe(t))2}, cGE, (11) 

surrogate ^ E{{B,{t)f} . (12) 



The process Bi{t) can be interpreted as the deviation of a virtual storage level process from the 
midpoint Si/2, where the virtual storage level process is no longer hard-limited but evolves linearly. 
Instead, we penalize the deviation through a quadratic cost function in the additional performance 

; surrogate 

parameters s^. 

The virtual processes B{t), F{t), W{t), Y{t) and Z{t) are connected to the actual processes 
B{t), T{t), W(i), y{t) and Z{t) via the mapping (where [x]^ := max(min(a;, 6), a) for a < b): 



Zi{t) = Zi{t), Fe{t) = F,{t), (13) 

Bi{t) = [B,{t)+Si/2]l^ , (14) 

yi{t) = Bi{t + l)-B,{t), (15) 

W,{t) = Wi{t)+Yi{t)-yi{t). (16) 



In particular, once we solve for the optimal control in the surrogate LQ problem, (llSp and (jl6p tell 
us what control to use in the actual system. Notice that the actual fast generation control provides 
the fast generation in the virtual system plus an additional term that keeps the actual storage level 
process within the hard limit. Note also 

Wi(t) > W,{t) - {Bi{t) - Si/2)+ - {-Bi{t) - 5,/2)+ . (17) 

Hence the performance parameters ej^, ey^ can be estimated from the corresponding ones for the 
virtual processes. 

Now we turn to solving the surrogate LQ problem. First we formulate it in standard state- 
space form. For simplicity, we will assume {Z{t)}t>o is an i.i.d. process (over time)ll Hence 

^The case of a process {Z{t)^t>o with memory can be in principle studied within the same framework, by intro- 
ducing a hnear state space model for Z{t) and correspondingly augmenting the state space of the control problem. 
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X{t) := [F{t - ly , B{ty Y is the state of the system. Also, U{t) := [Y{ty , W{ty Y is the control 
and R{t) := [X (t)'^ , Z (t)'^Y" is the observation vector available to the controller. Then 



X{t + 1) 
Rit) 



AX{t)+'DU{t) + BZ{t), 
CX{t) + at), 



(18) 
(19) 



where 




I 



D 



I 

















and C 




I 



and C{t) 



Z[t). We are interested in trading off between the performance 



parameters eFe-,£Wt and e_B. 's. Therefore we introduce weights 7e's , ^j's, r/j's and define the 
Lagrangian 



\E\ \V\ \V\ 

e=l i=l 1=1 

= E{X{tfQiX{t) + UitfQ2U{t)} , 



(20) 



where Qi = diag(7i, . . . , 7|_^, ^i, . . . , ^H/|) and Q2 = diag(0, . . . , 0, r/i, . . . , r?|£;|). 

We win let E{Z(t)} = Z. We will also assume that T^z = E[Z{t)^Z{t)] = I, since if not, then 
we can define E = [VA^^-y/S^ ^0]-^, where \/T,z is the symmetrical square root of S^. 

An admissible control policy is a mapping {R{t),R{t — 1), . . . ,i?(0)} 1— )• U{t). The surrogate 
LQ problem is defined as the problem of finding the mapping that minimizes the stationary cost 
£ = limt_^oo£(i)- 

Notice that the energy production-minus-consumption Z(t) plays the role both in the evolution 
equation (fTH]) and the observation (fT9|) . The case of correlated noise has been considered and solved 
for general correlation structure in [9|. Let G = E[C{t)Z{t)'^] = [lOf, Ri{t) = [{Z{t) -Z)'^,Of 
and R2{t) = [0, B{t)^Y^ . Adapting the general result in ^ to our special case, we have 

Lemma 3.1. The optimal linear controller for the system in (|18p and p9|) and the cost function 
in (|20p is given by 

U{t) = -(Lfli(t) + K-iD^SEG^M-ii?2(i)) + U, (21) 
where, letting rj = diag(r7i, . . . ,r/|v|) o,nd 7 = diag(7i. . . . ,7|y|).' 

U = 



Y 
W 







[/ - A(V^7V)-iAr/] ^Z 
and S is given by the algebraic Riccati equation 



(22) 



S = A^SA + Qi - L^KL, 



(23) 
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where K = D^SD + Qi'Q2, L = K-i(D^SA + Q^Qi), and 

(24) 



M = CJC^ + 



I 





where J satisfies the algebraic Riccati equation J = AJA-^ + EE'^ — OMO-^, and O = (AJC^ + 
EG'^)M-i. 

Note that the optimal hnear controher has a deterministic time-invariant component U = 

[F^,TF^]^ and an observation-dependent control -(Li?i(t) + K~^D^SEG'^M~^i?2(i))- It is intu- 
itive that Y = 0, since otherwise the storage process has a non-zero drift and will become unstable. 
The deterministic component W for the fast generation can be seen to be the solution of a static 
optimal power flow problem with deterministic net renewable generation Z and cost function given 
by: 

\E\ _ \V\ _ 

e=l i=l 

On the other hand, the observation-dependent control is obtained by solving the LQ problem with 
the net generations shifted to zero-mean. Thus, the LQ design methodology naturally decomposes 
the control problem into a static optimal power flow problem and a dynamic problem of minimizing 
variances. 

Notice that it might be also convenient to consider more general surrogate costs in which a 
generic quadratic function of the means Fg , Wi is added to the second moment Lagrangian (l20|) . 

3.2 Transitive Networks 

Lemma |3. II gives an expression for the optimal linear controller. However, it is difficult in general 
to solve analytically the Riccati equation. To gain further insight, we consider the case of transitive 
networks. 

An automorphism of a graph G = {V, E) is a one-to-one mapping f : V ^ V such that for 
any edge e = {u,v) G E, we have e' = {f{u),f{v)) £ E. A graph is called transitive if for any 
two vertices vi and V2, there is some automorphism f : V ^ V such that /(fi) = V2- Intuitively, 
a graph is transitive if it looks the same from the perspective of any of the vertices. Given an 
electric network, we say the network is transitive if it has a transitive graph structure, every bus 
has the same associated storage, every line has the same capacity and inductance, and Zi{t) is 
i.i.d. across the network. Without loss of generality, we will assume Si — 5*, — C, b^ — 1, 
E[Zj(t)] = fi,Yar[Zi{t)] = . Since the graph is transitive, it is natural to take the cost matrices 
as Qi = diag(7, . . . , 7, ^, . . . , ^) and Q2 = diag(0, . . . , 0, 1, . . . , 1). Moreover, it can be seen from 
Eq. (|22p that C/ = 0. Since the mean net production is the same at each node, the static optimal 
power flow problem is trivial with the mean flows being zero. We are left with the dynamic variance 
minimizing problem. 

Recall that A = — Vo'^V^ is the eigenvalue decomposition of A. Since A = -V^V, the 
singular value decomposition of V is given by V = UaV"^ for some orthogonal matrix U. The 
basic observation is that, with these choices of Qi and Q2, the Riccati equations diagonalize in the 
bases given by the columns of V (for vectors indexed by vertices) and columns of U (for vectors 
indexed by edges). 
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A full justification of the diagonal ansatz amounts to rewriting the Riccati equations in the new 
basis. For the sake of space we limit ourselves to deriving the optimal diagonal control. We rewrite 
the linear relation from X(t) to U{t) as 



Y{t) = HZ(t)-K5(t), 
W{t) = PZ{t) + QB{t). 

Substituting in Eq. (fTSi) . we get 

B{t + 1) = (I-K)S(t) +HZ(t), 



F{t + 1) = VA-^{(I - H - P)Z{t) 
W{t) = PZ{t) + QB{t). 



(K 



Q)i?(t)} 



(25) 
(26) 

(27) 
(28) 
(29) 



Denoting as above by B, F, W the average quantities, it is easy to see that, in a transitive network, 
we can take F = 0, W = and hence B = 0. In words, since all nodes are equivalent, there is 
no average power flow (F = 0), the average overproduction is dumped locally {W = /x), and the 
average storage level is kept constant {B = 0). 

We work in the basis in which V = UaV^ is diagonal. We will index singular values by G 
hence a = diag({a(0)}ege) (omitting hereafter the singular value a = since the relevant quantities 
have vanishing projection along this direction.) In the examples treated in the next sections, 6 will 
be a Fourier variable. Since the optimal filter is diagonal in this basis, we write K = diag(/c(0)), 
H = diag(/i(0)) and P = diag(p(0)), Q = dmg{q{9)). _ _ _ 

We let bg{t), ze{t), fe{t), wg{t) denote the components oiB{t)-B, Z{t)-^, F{t)-F, W{t)-W 
along in the same basis. From Eqs. ([271) to (f29]l . we get the scalar equations 



beit) 
feit) 



Wg(t) 



{l-k{9))be{t-l) + h{e)ze{t), 
-a-He){il - h{e) - pi9))ze{t) ^ 

{k{0)-qi9))be{t-l)} 

V{e)ze{t)+q{e)be{t-l). 



(30) 

(31) 
(32) 



We will denote by <j'^{9), crp[9), a^,{9) the stationary variances of bo^t), fe{t), W0{t). From the 
above, we obtain 



(1- 
(1 



A;)2 ' 
-h-pf 



+ 



h'^{k-qf 
l-{l-kf 



a 



1 



a 



(33) 
(34) 
(35) 



(1 - A;)2 

(We omit here the argument 9 on the right hand side.) 

In order to find h,k,p,q, we minimize the Lagrangian (j2Up . Using Parseval's identity, this 
decomposes over 9, and we can therefore separately minimize for each 9 £ Q 



C{9) = aw{0f + e MGf + 7 MGf ■ 
A lengthy but straightforward calculus exercise yields the following expressions. 



(36) 
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No transmission (C = 0) 



No storage {S = 0) 



Storage and Transmission 



0(^) for /iS < a' 

(Texp| — ^1 otherwise 



0(^) for iiC < a= 

o"Cxp| — otherwise 



o"exp|-§}^ for At = exp|-a;(^§)} 
o-expj-S^j for ;i = exp|-o(^£)} 



r Cmax(C,5)l 

'^cxpl 



Table 1: Asymptotically optimal ew + ejF in 1-D and 2-D grids. Logarithmic factors have been neglected 
(also in the exponent), f indicates the lower bound requires a conjecture in probability theory. 



Theorem 1. Consider a transitive network. The optimal linear control scheme is given, in Fourier 
domain E B, by 



where /3{9) is given by 



p{e) = q{e) = i ^ ^ , (37) 

_ 2m + 1 - V4/3(g) + 1 

^(^^ me) ' ^ ^ 



It is useful to point out a few analytical properties of these filters: (i) 7/[C(7 + c^max)] < /? < 1/C 
with dmax the maximum degree in G] (ii) < A; < 1 is monotone decreasing as a function of /3, 
with /c = 1 - /3 + 0(/32) as /3 ^ and /c = 1/V^ + 0(l//3) as /3 oo; (iii) < /i < 1 is such that 
h + k = 1. In particular, it is monotone increasing as a function of /3, with h = j3 + 0{/3'^) as /3 — )• 
and = 1 - l/V? + 0(l//3) as /3 ^ oo; (zu) p = q = ipk. 

Theorem 2. Consider a transitive network, and assume that the optimal LQ control is applied. 
The variances are given as follows in terms of k{6), given in Eq. \38^) : 



al{9) _ {l-k{e)y 



cj2 i_(i_A:(0))2 ' 



(j2 (^ + a2(0))2 i_(i_A;(e))2 ' 

a2 (^ + a2(0))2 i-(l-A;(0))2 ■ 

4 1-D and 2-D Grids: Overview of Results 



(41) 
(42) 
(43) 



For the rest of the paper, we focus on two specific network topologies: the infinite one-dimensional 
grid (line network) and the infinite two-dimensional grid. We will assume that the net generations 
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are independent across time and position, with common expectation EZj(t) = /i, and we will place 
weak assumptions on the distributions (to be specified precisely later in Section [6j) We will focus 
on the regime when the achieved cost is small. In Section [5] we will evaluate the performance of the 
LQ scheme on these topologies. In Section [71 we will derive lower bounds on the performance of 
any schemes on these topologies to show that the LQ scheme is optimal in the small cost regime. 
As a result, we characterize explicitly the asymptotic performance in this regime. The results are 
summarized in Table [TJ 

Although the i.i.d. assumption simplifies significantly our derivations, we expect that the quali- 
tative features of our results should not change for a significantly broader class of processes {Zi{t)}t- 
In particular, we expect our results to generalize under the weaker assumption that Zi{t) is sta- 
tionary but close to independent beyond a time scale T = 0(1). 

The parameter //, the mean of the net generation at each node, can be thought of as a measure 
of the amount of over-provisioning. Let us first consider that case of a one-dimensional grid and 
assume that fi is vanishing or negligible. In other words, the average production balances the 
average load. Our results imply that a dramatic improvement is achieved by a joint use of storage 
and transmission resources. Consider first the case C = 0. The network then reduces to a collection 
of isolated nodes, each with storage S. It can be shown that the optimal cost decreases only slowly 
with the storage size S, namely as 1/S. Similarly, when there is only transmission but no storage, 
the optimal cost decreases only slowly with transmission capacity C, like 1/C. On the other hand, 
with both storage and transmission, the optimal costs decreases exponentially with V CS. Consider 
now positive over-provisioning ^ > 0. When there is no storage, the only way to drive the cost 
significantly down is at the expense of increasing the amount of over-provisioning beyond cr^/C The 
same performance can be achieved with a storage S equalling to this amount of over-provisioning 
and with the actual amount of over-provisioning exponentially smaller. 

The 2-D grid provides significantly superior performance than the 1-D grid. For example, the 
cost exponentially decreases with the transmission capacity C even without over-provisioning and 
without storage. The increased connectivity in a 2-D grid allows much more spatial averaging of 
the random net generations than in the 1-D grid. In order to understand the fundamental reason 
for this difference, consider the case of vanishing over-provisioning fi = and vanishing storage 
5 = (also see Figure [1]). Consider first a 1-D grid. The aggregate net generation inside a segment 
of I nodes has variance la"^ and hence this quantity is of the order of ^Aa. This random fluctuation 
has to be compensated by power delivered from the rest of the grid, but this power can only be 
delivered through the two links, one at each end of the segment and each of capacity C. Hence, 




Figure 1: Boxes of side I in 1-D and in 2-D. The ratio boundary/ V volume is G(l/\//) in 1-D, and 
9(1) in 2-D. 
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successful compensation requires I < C'^/a^. One can think of := C'^ ja^ as the spatial scale 
over which averaging of the random generations is taking place. Beyond this spatial scale, the 
fluctuations will have to be compensated by fast generation. This fluctuation is of the order of 
\fllo jl^ = a'^/C per node. Note that a limit on the spatial scale of averaging translates to a large 
fast generation cost. In contrast, in the 2-D grid, {i) the net generation, and (ii) the total link 
capacity connecting an / x / box to the rest of the grid, both scale up linearly in I. This facilitates 
averaging over a very large spatial scale /, resulting in a much lower fast generation cost. 

There is an interesting parallelism between the results for the 1-D grid with storage and the 2-D 
grid without storage. If we set S = C, the results are in fact identical. One can think of storage as 
providing an additional dimension for averaging: time (Section 17.21 formalizes this). Thus, a one 
dimensional grid with storage behaves similarly to a two-dimensional grid without storage. 



5 Performance of LQ Scheme in Grids 

In this section we evaluate the performances of the LQ scheme on the 1-D and 2-D grids. Both are 
examples of transitive graphs and hence we will follow the formulation in Section 13.21 For these two 
examples, the operator A is in fact invariant to spatial shifts so the 0-domain which diagonalizes 
the operator is simply the (spatial) Fourier domain. 

For simplicity, in this section, we consider the case where Zi{t) are gaussian. In the next section 
we show how all our results immediately generalize to a much larger and more realistic class of 
distributions. 

Suppose that Zi{t) ~ N(/i,(7^) iid across nodes and time. It follows that B,F,W are Gaussian, 
and using Eq. (IITD . we get the following estimates 



(44) 



Here F is the tail of the Gaussian distribution F(z) = j'^ <^{x)Ax = <!>(— z), where (f){x) = 
exp{— x^/2}/\/27r the Gaussian density and ^{x) = (f>{u)du is the Gaussian distribution. 

In order to evaluate performances analytically and to obtain interpretable expressions, we will 
focus on two specific regimes. In the first one, no storage is available but large transmission capacity 
exists. In the second, large storage and transmission capacities are available. 



5.1 No storage 

In order to recover the performance when there is no storage, we let ^ — )• oo, implying o"^ — )• by 
the definition of cost function (j36p . In this limit we have /3 — )• 0, cf. Eq. (j40p . Using the explicit 
formulae for the various kernels, cf. Eqs. (i37|l to (f39l) . we get: 

1 = ^^2(n^ + 0(1/0 , h = 0(1/0 ,k = l- 0(1/0 . 

7 -I- a^[p) 

Substituting in Eqs. (j27|) to ([28|l we obtain the following prescription for the controlled variables 
(in matrix notation) 

y(t) = W{t)= 7(-A + 7)-iZ(t), (45) 
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while the flow and storage satisfy 

B{t) = 0, F{t) = V{-A + j)-^Z{t), (46) 

The interpretation of these equations is quite clear. No storage is retained {B = 0) and hence no 
energy is transferred to storage. The matrix 7(— A + 7)^^ can be interpreted a low-pass filter and 
hence 7(— A-|-7)~^Z(t) is a smoothing of Z(t) whereby the smoothing takes place on a length scale 
^-1/2^ The wasted energy is obtained by averaging underproduction over regions of this size. 

Finally, using Eqs. (j42p and (j43p . we obtain the following results for the variances in Fourier 
space 

apie)^ ^ a\9) awie? ^ 7' 

5.1.1 One-dimensional grid 

In this case 6 € [— 7r,7r], and a{6)'^ = 2 — 2cos0 (the Laplacian A is diagonalized via Fourier 
transform). The form of the optimal filter P is shown in Figure El 

The Parseval integrals can be computed exactly but we shall limit ourselves to stating without 
proof their asymptotic behavior for small 7. 

Lemma 5.1. For the one- dimensional grid, in absence of storage, as 7 — )■ 0, the optimal LQ control 
yields variances 

4 = aV4V7 {1 + 0(7)} , crl, = a^^/A[l + 0(7)} . 

Using these formulae and the equations (j44p for the performance parameters, we get the follow- 
ing achievability result. 

Theorem 3. For the one- dimensional grid, in absence of storage, the optimal LQ control with 
Lagrange parameter 7 = [i^ jC^ yields, in the limit ^/C — )• 0, /a"^ — )• oo.' 

etot<exp{- ^(1 + 0(1))}. (47) 

The choice of 7 given here is dictated by approximately minimizing the cost. In words, the cost 
is exponentially small in the product of the capacity, and overprovisioning /xC. This is achieved by 
averaging over a length scale 7~^/2 = (j j ^ that grows only linearly in C and 1/^. Note that the 
extent of averaging is limited by the transmission capacity C: the larger the extent of averaging, 
the larger the amount of power which has to be transported across the network. Optimal filters P 
for two different values of 7~^/2 g^j.g displayed in Figure [2l 

5.1.2 Tvi^o-dimensional grid 

In this case 9 = {9i, ^2) S [— ^r, 7r]2, and a{6)'^ = 4 — 2 cos 9i — 2 cos ^2- Again, we evaluate Parseval's 
integral as 7 — )• 0, and present the result. 
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Figure 2: The filter P for a one-dimensional grid % in the case in which no storage is available. 
Notice that by translation invariance Pjj = Pi-j for any i,j E Z, and further P„ = P_„. Here we 
plot Pn for two values of the effective length scale 1 / 
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Lemma 5.2. For the two-dimensional grid, in absence of storage, as 7 — )• 0, the optimal LQ control 
yields variances 



Using these formulae and the equations (j44p for the performance parameters, and approximately 
optimizing over 7, we obtain the following achievability result. 

Theorem 4. For the two-dimensional grid, in absence of storage, the optimal LQ control with 
Lagrange parameter 7 = {ji^ /C'^)\og{C'^ / li^e) yields, in the limit [ijC — )• 0, (^^/(fT^ log(C/^)) = 
M 00; 

^-°-^--4" c.Mog%v.2,) (i+'^a))} • m 

Notice the striking difference with respect to the one-dimensional case, cf. Theorem [31 The cost 
goes exponentially to 0, but now overprovisioning plays a significantly smaller role. For instance, 
if we fix the link capacity C to be the same, the exponents in Eq. (j47p are matched if /i2d ~ 
exp(— 7rC/2;Uid)}, i.e. an exponentially smaller overprovisioning is sufficient. 

5.2 With Storage 

In this section we consider the case in which storage is available. Again we focus on the regime 
where the optimal cost is small. Within our LQ formulation we want therefore to penalize am 
much more than ob and ap- This corresponds to the asymptotics 7— )-0, ^ = 7/5— )-0 (the ratio s 
need not to be fixed). It turns out that the relevant behavior is obtained by considering c? = 6(7) 
and hence /? — >• 00. The linear filters are given in this regime by 

piO)=q{0) = {7/Vs) {^ + a{ef)-'/\ 

kie)^{l/V~s){7 + ai9)Y\ hi9)^l. 



Using these filters we obtain 



apie? a\e) { 1 ^ 



ct2 2^ V7 + a(^)^ 



a2 2^V7 + aW^ 

5.2.1 One-dimensional grid 

The variances are obtained by Parseval's identity, integrating cr^ w F^O) over € [— vr, vr]. The form 
of the optimal filters P and K is presented in Figure [3l We obtain the following asymptotic results. 
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12 12 

10 10 10 10 10 10 

iog(n) log(n) 



Figure 3: The filters P, K for a one-dimensional grid TL in the case in which storage is available. 
Notice that by translation invariance Pjj = P%-j^ Kjj- = i^i-j for any G Z, and further 
Pn = P-n, Kn = K^n- Here we plot the filters for effective length scale: l/y/j = 30 and S = C 
(top panels) or 5 = C/4 (lower panels). 
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Lemma 5.3. Consider a one- dimensional grid, subject to the LQ optimal control. For 7 — )• and 
^ = 7/s — ;> 

4 = # i°g - + Oil) , 4 = i°g - + 0(1, s-') , 

C7^ 47r 7 o"^ 4:^T^/s 7 



1 ^ + 0(72,73/2/5) 



a2 2^ 

where 0,^ 'is the integral (here d'^u = dui x • • • x duj_) 

nd = l/{2Tr)'^ f l/{l + \\uff/^d'^u. (49) 
Using Eqs. (|44p to estimate the total cost etot and minimizing it over 7 we obtain the following. 



Theorem 5. Consider a one- dimensional grid and assume CS/a"^ — )• 00. The optimal LQ scheme 
achieves the following performance: 



1^ 



1^ 



etot <exp| - y^^(l + o(l))}, 

r 7rC75(l + o(l))-i 
2a2logC/^ I' 



under the further assumption \/T^CSj2a^ —\og{C / S) 00 (in the first case) and log(C//i)/ min(C, 5)^ 
(in the second). In the first case the claimed behavior is achieved by s = S'^/AC'^, and 7 = 
exp{-(2^C5/a2)V2}. second by letting s = S^/AC^, and 7 = /x^ log(C//x)/(7rl7iC2). 

This theorem points at a striking threshold phenomenon. If overprovisioning is extremely 
small, or vanishing, then the cost is exponentially small in V OS. On the other hand, even a 
modest overprovisioning changes this behavior leading to a decrease that is exponential in CS" 
(barring exponential factors). Overprovisioning also reduces dramatically the effective averaging 
length scale 7~^/2_ ^Iso instructive to compare the second case in Theorem [5] with its analogue 
in the case of no storage, cf. Eq. (I47p : storage seem to replace overprovisioning. 



5.2.2 Two-dimensional grid 

As done in the previous cases, the variances of B, F, W are obtained by integrating cr^ fw(^) 
e = {01,92) G [-^,vr]2. 

Lemma 5.4. Consider a two-dimensional grid, subject to the LQ optimal control. For 7—7-0 and 
s = 0(1), we have 

2 2 
^ = Gb{s) + 0(1, V7) , ^ = Gf{s) + 0( Vt) , 

where Q2 is the constant defined as per Eq. ^9^ , and Gsis), Gp{s) are strictly positive and bounded 
for s bounded. Further, as s —)• 00 Gb{s) = /C2\/s/2 + 0(1), Gf{s) = /C2/(2y^) + 0(l/s), where 
^2 = /[_^,^]2 1^ d9. 
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Minimizing the total outage over 5,7, we obtain: 



Theorem 6. Assume CS/a"^ — >• co and C/S = Q{1)- The optimal cost for scheme a memory-one 
linear scheme on the two-dimensional grid network then behaves as follows 

( CS 1 

^"^^'">{-2;;wo(i+°(i))}' (50) 

Here u 1— t- T{u) is a function which is strictly positive and hounded for u bounded away from and 
00. In particular, T{u) — )• /C2 as u ^ 00, and T{u) = Tqu + o{u) as u ^ (Tq > 0^. 

The claimed behavior is achieved by selecting s = f{S/C), and^ as follows. If n = exp{— o(CS'/(T^ 
then-i = f{S/C){iJ^/CSf/^. If instead ^l = e^-p{-uj{C S / a"^)} , then-r = e^p{-2CS/{3T{S/C)a^)}, 
for suitable functions f,f (In the first case, we also assume fi/C — t- 0.) 

The functions T,f,f in the last statement can be characterized analytically, but we omit such 
characterization for the sake of brevity. As seen by comparing with Theorem \5\ the greater con- 
nectivity implied by a two dimensional grid leads to a faster decay of the cost. 



6 Extension to a larger class of distributions 

We find that our results from Section [5] immediately generalize to a much broader class of distri- 
butions for Zi{t) than Gaussian. 

To define this class, first we provide the definition of sub-Gaussian random variables. (See, for 
instance, [10] for more details.) 

Definition 6.1. A random variable X is sub-Gaussian with tail parameter if, for any A G M, 
Two important examples of sub-Gaussian random variables are: 

1. Gaussian random variables with variance are sub-Gaussian with tail parameter = 0"^. 

2. Random variables with bounded support on [a, b] are sub-Gaussian with tail parameter s"^ = 
{h-af/A. 

Notice that the tail parameter is always an upper bound on the variance cr^, namely cx^ < (this 
follows by Taylor expansion of Eq. (|5ip for small A). We will consider the class of distributions for 
the net production Zi{t) to be sub-Gaussian with tail parameter of the same order as the variance. 
More precisely: 

Definition 6.2. Fix constant k > 0. Let S{k) be the class of distributions such that the sub- 
Gaussian tail parameter and the standard deviation satisfy: 

s^ 

1 < ^ < K. 
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This is a natural class of distributions for modeling renewable power production. The power 
generated by wind turbines, for example, are bounded between and a upper power limit, with 
significant probability that the power is near or capped at the upper limit. Hence, it is of 
bounded support with the range comparable to the standard deviation. It is sub-Gaussian with 
tail parameter of the same order as the variance. 

We will now argue that all the results we derived in Section [5] for Gaussian net productions 
extend to this class of distribution. The only fact we used to connect variances with the costs, 
where we used the Gaussianity assumption, is Eq. (j44p . This equation implies that decreases 
exponentially with iCjap)'^^ and e>v decreases exponentially with {SjoBf' and with (/u/ai^)^, 
which in turns leads to Theorems O [H [5l [H We will show that these exponential dependencies hold 
for distributions in as well, and a similar versions of these theorems hold for these distributions. 

First we need some elementary properties of sub-Gaussian random variables. The first property 
follows by elementary manipulations with moment generating functions. 

Lemma 6.3. Assume Xi and X2 to be independent random variables with tail parameters sf and 
82- Then, for any ai, 02 € M, X = aiXi + 02^2 is sub-Gaussian with tail parameter (af sf + a^sl). 

Notice that by this lemma, the parameters of sub-Gaussian random variables behave exactly as 
variances (as far as linear operations are involved). In particular, it implies that the class S{k) is 
closed under linear operations. 

The second property is a well known consequence of Markov inequality, and shows that the tail 
of a sub-Gaussian random variable is dominated by the tail of a Gaussian with the same parameter. 

Lemma 6.4. If X is a sub- Gaussian random variable with parameter s^, then, for any a > 
¥{X >a + KX},¥{X <-a + EX} < exp{-a'^/{2s^)}. 

Now suppose the net productions Zj(t)'s have distributions in S{k). The LQ scheme developed 
in Section [3] implies that the controlled variables Bi{t), Fe{t), Wi{t) are linear functions of the net 
productions Zi(t), and hence it follows that Bi(t), Fe{t), Wi{t) are in S{k). Now, if we let be 
the flow at edge e at steady-state, with sub-Gaussian tail parameter Sp, then 



Here, as in Eq. (j44p . F( • ) is the complementary cumulative distribution function of the standard 
Gaussian random variable: F(x) = 1 — ^>(x). Similarly, one can show that: 



E{(j;-c)+ + (-c- j-e)+} 




00 r-C 
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Thus, £j decreases exponentially with (C japf'^ and e>v decreases exponentially with [SjaB)^ 
and with (/x/cjvk)^) as in the Gaussian case, except for an additional factor of 1/k in the exponent. 
This means that analogues of Theorems El SJ El E] also hold for distributions in with an 

additional factor of 1/k in the exponent of etot- Note that all these exponents are proportional to 
where is the variance of the Gaussian distributed Zi{t). Therefore, equivalently, one can 
say that the performance under a sub-Gaussian distributed Zi{t) with tail parameter is at least 
as good as if Zi{t) were Gaussian with variance s^. 



7 Performance limits 

In this section, we prove general lower bounds on the outage etot = £>v + of (^ny scheme, on 
the 1-D and 2-D grids. Our proofs use cutset type arguments. Throughout this section, we will 
assume the Zi{t) to be i.i.d. random variables, with Zi{t) ~ N(/x, cj^), with the exception of the case 
of a one-dimensional grid without storage, cf. Theorem [71 In this case, we will make the weaker 
assumption that the Zi{t) are i.i.d. sub-Gaussian. 



7.1 No storage 

7.1.1 One-dimensional grid 

Theorem 7. Consider a one- dimensional grid without storage, and assume the net productions 
Zi{t) to he i.i.d. sub-Gaussian random variables in the class S{k). (In particular this assumption 
holds if Zi{t) ~ N(^,c72).; 

There exist finite constants kq, ^1,^3 > dependent only on k, such that the following happens. 
For yu < KQa and a < kqC, we have 

etot > I ''1'^^/^ i//x<(j7C, 
° ~ \ /xexp { — K2/^C/cJ^} otherwise. 

Proof. Consider a segment of length £. Let E be the event that the segment has net demand at 
least 3C. Then we have 

P[E] > K3 exp I ^-3^ j , (53) 

for some K3,K4 > 0. (This inequality is immediate for Zi(t) ~ H{fi,a'^) and follows from Lemma 
lA. II proved in the appendix for general random variables in S{k).) 

If E occurs at some time t, this leads to a shortfall of at least C in the segment of length i. This 
shortfall contributes either to ew or to 2ejr, yielding 

2etot > ew + 2e^ > exp | - ^^-^| . (54) 

Choosing i = min (C//i, C^/cj^), we obtain the result. □ 

Note that the lower bound is tight both for jj, > cr^/C (by Theorem El) and < jC (by a 
simple generalization of the same theorem that we omit). 
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7.1.2 Two-dimensional grid 

We prove a lower bound almost matching the upper bound proved in Theorem 31 
Theorem 8. There exists k < oo such that, for C > m.m.{^,a), 

£tot > crexp I - kC^/cj^I . 

Proof. Follows from a single node cutset bound. □ 

We next make a conjecture in probability theory, which, if true, leads to a significantly stronger 
lower bound for small For any set of vertices A of the two-dimensional grid, we denote by dA 
the boundary of A, i.e., the set of edges in the grid that have one endpoint in A and the other in 
A''. 

Conjecture 7.1. There exists 6 > such that the following occurs for all i £ N. Let {X^)^(zs be 
a collection of i.i.d. N(0, 1) random variables indexed by S = {1, x {1, ■■■,£} ^ Z^. Then 



E 



max >^ X, 

^C<Ss.t. ^ 

\aA\<ii "S.A 



>5l\ogl. (55) 



It is not hard to see that this conjecture implies a tight lower bound. 



Theorem 9. Consider the two-dimensional grid without storage, and assume Conjecture 7.1. Then 
there exists k < oo such that for any fi < o" exp(— kC/ct) and C > a we have 

etot > cr exp I - KC/fj| . (56) 



Proof. Consider a square of side i. Conjecture 17.11 yields that we can find a subset of vertices 
in the square with a boundary capacity no more than ACi, but with a net demand of at least 
6a£loge-fif. This yields 

6anogi-4C£- fif 
2etot >ew + > ^—^ ^ . (57) 

Choosing i = exp(«;C/o") with an appropriate choice of k, we obtain the result. □ 
Our conjecture was arrived at based on a heuristic divide-and-conquer argument. We validated 
our conjecture numerically as follows: We obtain a lower bound to the left hand side of Eq. (j55p . 
by maximizing over a restricted class of subsets Sop, consisting of subsets that can be formed by 
dividing the square into two using an oriented path (each step on such a path is either upwards or 
to the right). It is easy to see that if 5 G Sop, then \dS\ < 41. Define 

G{1) = max V Xy , (58) 
5GSop 

where Sop is implicitly a function of I. The advantage of considering this quantity is that G{1) 
can be computed using a simple dynamic program of quadratic complexity. Numerical evidence, 
plotted in Figure [7.1.21 suggests that E[G(Z)] = Q{llogl), which implies our conjecture. 
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Figure 4: Numerical evidence for Conjecture I7.lt The approximate straight line in the plot (cf. 
definition of G{1) in Eq. ([58]) ) suggests the validity of our conjecture. 



7.2 With storage 

7.2.1 One-dimensional grid 

Our approach involves mapping the time evolution of a control scheme in a one-dimensional grid, 
to a feasible (one-time) flow in a two-dimensional grid. One of the dimensions represents 'space' in 
the original grid, whereas the other dimension represents time. 

Consider the one-dimensional grid, with vertex set Z. We construct a two-dimensional 'space- 
time' grid {V,E) consisting of copies of each v € V, one for each time t G Z: define V = {{v,t) : 
u G Z, t E Z}. The edge set E consists of 'space-edges' E^^ and 'time-edges' E^. 

^ EE U E^ 

E'P = {{{v, t), {v + l,t)) -.v £Z,t£ Z} 
E^ = {{{v, t),{v,t + l)) -.v £Z,t£ Z} 

Edges are undirected. Denote by Ce the capacity of e £ E. We define Cg = C for e G i^'^P and 
de = 5/2 for e e E\ 

Given a control scheme for the 1-D grid with storage, we define the fiows in the space-time grid 

as 

= J'{v,v+i]{t) for e = {{v,t), {v + l,t)) e E'P 

Te = B^{t + 1) - 5/2 for e = {{v, t), {v, t + 1)) € E^ 

Notice that these fiows are not subject to Kirchoff constraints, but the following energy balance 
equation is satisfied at each node {v, t) £ V, 



z,{t)-Wiit)-y,it) 



E 



iv,t),{v',t') 



(59) 



{v',t')(^d{v,t) 
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We use performance parameters as before (this definition applies to finite networks and must be 
suitably modified for infinite graphs): 

\hj\ ^ 

' ' eeE 

Notice that eyy is unchanged, and = in our mapping from the 1-D grid with storage to the 
2-D space-time grid. 

Our first theorem provides a rigorous lower bound which is almost tight for the case n = 
Q-o{-\fcsJa^) |-p£_ Theorem [5]). It is proved by considering a rectangular region in the space-time 
grid of side / = max(C/S', 1) in space and T = max(l, 5/C) in time. 

Theorem 10. Suppose < min(C, 5), CS/a^ > max(log((T/min(C, S)), 1). There exists k < oo 
such that 

etot><Texp{-KCS/a^). (60) 



Proof. Consider a segment of length £ = max(C/5, 1) and a sequence of T = iS/C consecutive 
time slots. (Rounding errors are easily dealt with.) The number of nodes in the corresponding 
region TZ in the space-time grid is 

n = IT = max(C, S)/ min(C, S). 

The cut, i.e., the connection between TZ and the rest of the grid, is of size 2{IS +TC) = 4max(C, S). 
The net generation inside TZ is N(n/i, cr^n). Now /u < C by assumption, implying n/i < max(C, S). 
Let E be the event that the net generation inside IZ is at least 5max(C, S). We have 

P[E,, exp (--(!«!) 2 e,p(--;P) 

for some ni < oo. Moreover, E leads to a shortfall of at least max(C, S) over n nodes in the 
space-time grid. It follows that 

^tot > ( max(C, S)/ri) exp ^ — — ^ = min(C, S) exp 

which yields the result, using CSja^ > log(cj/ min(C, 5)). 

□ 

Next we provide a sharp lower bound for small fi using Conjecture 17.11 Recall Theorem [9] 
and notice that its proof does not make any use of Kirchoff flow constraints (encoded in Eq. ([3])). 
Thus, the same result holds for a 2-D space-time grid. We immediately obtain the following result, 
suggesting that the upper bound in Theorem [5] for small fi is tight. 



KlCS 
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Theorem 11. There exists k < oo such that the following occurs if we assume that Conjecture \7. 1\ 
is valid. Consider the one- dimensional grid with parameters C = S > a, and < exp{—KC/a). 
We have 

etot > cr exp { - KvW^} • (61) 

We remark that the requirement C = S can be relaxed if we assume a generalization of Con- 
jecture [7?T] to rectangular regions in the two-dimensional grid. 

7.2.2 Two-dimensional grid 

Theorem 12. There exists a constant k, < oo such that on the two-dimensional grid, 

( kC max(C, 5*) 1 
etot > a exp I J ' | . (62) 

The theorem is proved by considering a single node, using a cutset type argument, similar to 
the proof of Theorem [TOl It implies that the upper bound in Theorem [6] is tight up to constants in 
the exponent. 
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A A probabilistic lemma 

Lemma A.l. Let {Xi,X2, ■ ■ ■ ,Xn, • • • } be a collection of i.i.d. sub- Gaussian random variables in 
S{k) with EXi = 0, E{Xf} = . 

Then there exists finite constants Ki = ki{k,) > 0, K2 = K2{k) > 0,no = f^o(^) depending 
uniquely on k such that, for all n > uq, < 7 < kiO", we have 



p{X:X.>,„}>ie.p{-^}. (63) 

i=l 

Proof. By scaling, we will assume, without loss of generality, cr^ = 1. Throughout the proof 
k',k",... denote constants depending uniquely on k. We will use the same symbol even if the 
constants have to be redefined in the course of the proof. 

For any A € M, let Pa, Ea denote probability and expectation with respect to the measure 
defined implicitly by 

Ea{/(Xi, . . . , = , (64) 

for all measurable functions /. Notice that this measure is well defined for all A by sub-Gaussianity. 

Let g{X) = Ea^i. Then A 1— )■ ^(A) is continuous, monotone increasing with g{0) = 0, ^'(A) = 
VarA(Xi), g"{X) = E\X'l — 'SKxXiE.xXf. Bounding these quantities by sub-Gaussianity, it follows 
that, for < A < «;', we have < VarA(Xi) < k" , and hence 

4<5(A)<k"A. (65) 
k" 
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Define 7+/- = liniA^ioo g{^)- Notice that 7_ < < 7+ and that (the inverse function of g) is 
well defined on the interval (7_,7_|_). 
Define 

MA) - . (66) 

By Taylor expansion, we get ^(A) = 1 - A^E(Xf ) + O(A^) = 1 - A^ + O(A^). Proceeding as above, 
it is not hard to prove that /i(A) > 1 — k"A^ for all < A < k' for some finite constants k', k" > 
(eventually different from above). Finally, for 7 G (7_,7_|_) we define 

H{^) = h{g-\^)). (67) 

Combining the above, we have .^(7) = 1 — k"j'^ for all 7 G [0, k'], and therefore 

H{-f) > e-^^/"^ for all 7 G [0, ki] . (68) 

Now, for 7 G [0, Ki], let E = £(7) be the event that Xi + ■ ■ ■ + X„ > 717. Take A = (7) and 
define •Z'(A) = exp{A^"^j^ Xj}. By Cauchy-Schwarz inequality 



P{E} > 



E{Ie^(A)}^} 
E{Z(A)2 

2 {^z{x)y 



= Fa{E} 



E{Z(A)2} 
P,{E}2i7(7)" 

The proof is completed by noting that Pa{E}2 > 1/4 for all n > no{K), by Berry-Esseen central 
limit theorem (note indeed that, under Pa, Xi,. . . ,Xn have mean 7, variance lower bounded by 
VarA(Xi) >k'>0 and Ex{\Xi\^) < k" < 00). □ 



> PA{E}2exp{-^|. 
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